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ABSTRACT 


The problem of distortion in underwater communications neculiar 
to free divers and techniques for analysis of sneech wave forms are 
discussed. The Fast Fourier Transform algorithm, selected to analyze 
shifts in formant frequencies due to restricted oral cavities, hiah 
ambient pressures, and forced speech is discussed. The Fast Fourier 
Transform is used to analyze a vowel sound and show that the exnected 
shifts do occur. Recommendations are made for extendina the techniques 
to all non-noise like sounds and breathing mixtures other than compressed 


air. 
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I. INTRODUCTION 


Man's future needs dictate the development of the vast areas of 
the continental shelf for food, raw materials, and livina space. To 
explore, map, and colonize this unknown a myriad of new equipment must 
be developed for explorers of this "last frontier." Today man has the 
capability to communicate reliably over the vast reaches of snace; but 
free divers (Divers with no attachment to a habitat or surface station — 
untethered) operating in close proximity cannot communicate reliably to 
each other. One UDT officer summed up the state of the art by sayina, 
"reliable communications between free divers consists of tugs on buddy 
lines." Unsatisfactory undersea communications result from poorly 
desiaqned oral cavities, high ambient pressures, poor electronics, and 
exotic gas breathing mixtures. Current equipment has been further 
complicated by designers unfamiliar with the environment and hazards 
associated with the free diving world; few designers have taken into 
consideration the user's emotional response to the apparatus in the 
underwater environment. 

In 1959-60, the problem of inadequate communications between free 
divers was identified in a specific onerational requirement of Navy UDT 
and Marine Corps Force Reconnaissance Divers. Since that time, the 
U. S. Navy and U. S. Marine Corps have been moving on separate paths to 
develop a reliable communicator for the free diver. Presently available 
equipment is far from an optimum desiqn for many reasons — poor overall] 
intelligibility being of prime importance. In short, the state of the 
art is unacceptable. This unacceptable condition, personal interest, 


and the requirement of the Armed Forces for more sophisticated devices 
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generated the oriainal topic of this research — an investiaation into 
speech intelligibility of free diver underwater communicators. After 
visitina with nersons enaaged in Navy sponsored work in this realm, it 
became apparent that more was involved than just better electronics. 
Inherent in improvina underwater communications devices is the need 

for better understanding of what happens to speech in the undersea 
environment. This fact made jt necessary to pursue a working knowledae 
of the disciplines of linguistics and speech processing. Repeated 
conferences with the Communications Sciences Laboratory, University of 
Florida stimulated the curiosity and study of these disciplines. 

Past speech processing has taken the form of qualitative analysis 
using analog equipment, such as spectographs, and intelliaibility 
studies using human listeners; however, few quantitative studies have 
been undertaken to isolate the major distortion problems. Many compar- 
ative studies of hard line and free diver communication systems have 
been and are being carried out, primarily using human speakers and 
listeners to evaluate system intelligibility usina controlled word 
lists [1]. The spectoaraph also gives a measure of quantitative anal- 
ysis to highly trained speech analysts. 

The use of frequency analysis (a common tool of the electrical 
engineer) for simple waveform analysis is a proven technique. In the 
past, this technique, transforming the signal under consideration to 
the frequency domain using the Fourier transform required too areat an 
expenditure of time and human effort for speech waveforms. The advent 
of digital computers with vast memory capabilities, now makes this 
technique a feasible method for speech waveform analysis. Transfor- 


mation of the speech waveforms is accomplished using the discrete 
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Fourier transform. This transformation is reliable if the sianal is 
band limited and occurs for a finite period of time. The Fast Fourier 
Transform (FFT) algorithm provides a rapid method for calculating the 
discrete Fourier transform of such siqnals [2]. This algorithm has 
many applications to digital filtering and controlling real time proc- 
esses in the frequency domain. The value is apparent when considering 
Signals that require 8,000 samples to provide complete coverage of 
frequencies of interest. It provides a real time saving of 300 to 1 
over previous algorithms; analysis that previously took hours now takes 
seconds. 

This is the approach taken tn this paper — investigation of dis- 
tortion in smal] underwater communicators using the FFT to nrocess 
diver's speech. The final value can only be imagined; the analysis can 
be applied to isolate distortion in helium speech, build better helium 
speech unscramblers and help to prevent loss of vital undersea explorers, 


such as Berry Cannon. 


A, FREE DIVER COMMUNICATION 

Whenever man goes underwater, he needs communication. Tasks under- 
water are made easfer and safer if communications exist between divers 
and/or surface stations. The military diver uses communications to 
carry out complicated tasks, to penetrate enemy held areas, and to effect 
rendezvous with underwater vehicles. Reliable underwater communications 
do not exist for all cases of interest to the U. S. Navy and U. S. Marine 
Corps. 

Communications between free divers is a relatively new need; but as 
man goes deeper for longer periods of time the requirement will increase 


in importance. Sonar has been used for communications between 


ee 


sSubmersibles and/or surface vessels for many years; small sonar sets 

are now performing the same service between divers, submersibles and/or 

surface stations. Many problems exist in the diver's case that are not 

of consideration in submarines. The more important include the addition 

of restricted oral cavities to the vocal tract, increased ambient pres- 

sures, increased noise levels, and the uses of exotic breathing mixtures. 
1. Current ‘Equipment 

Initially, efforts to provide the diver with communication 
involved amolifving the voice and projecting it into the underwater 
environment. Water noises near transmission frequencies, low power, and 
exclusion of microphones and earphones (for low cost) contributed to 
poor intelligibility and short range making these systems impractical 
for most military uses [3]. Electromagnetic and electric field trans- 
mission have also been tried with limited success at short ranaes. 

Sonar has provided the most successful method of eaumun tein 
to date. Using crystals possessing electrostrictive pronerties, a sound 
pressure or acoustic wave iS generated in to the water and received by a 
Similar transducer. Sets of this design operate around a carrier fre- 
quency of 8KHz-42KHz (the lower carrier producing greater range of 
transmission) and utilize AM-SSB/SC transmission to lower noise recep- 
tion and increase power. The Hydro-Products 811 and Bendix PNC-2 (see 
Figs. 1 and 2) underwater communicators are industry's latest efforts in 
the field. Characteristics of these sets are shown in Table I. 

Changes occur in these sets when placed on the diver. The omni- 
directional transducer becomes somewhat directional due to acoustical 
dampening by the lungs, tanks, and the neoprene wet suit worn in cold 


water. Two divers operating in close proximity can not communicate when 
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SET 


Carrier Frequency 
Electric Power 
Into Water 

Range 


Modes 


Transducer 


Pattern 


Microphone 


Receiver 


Intelligibility 


Bandwidth 


* Manufacturer's Data 


TABLE I 


HP 81] 
8.0875 KHz * 
AM-SSB/SC 
.5 watt - 


Nominal 800 meter * 
Voice 
Electrostrictive 
(barium titamate) 
Omnidirectional 


Oral Cavity 
Microphone 


Bone Conduction 
(one) 


No information 
(under test at 
U. of Fila.) 


300-3000 Hz * 
(2700 Hz) 





PONC-2 
8.0875 KHz * 
AM-SSB/SC 
.6 watt . 


500 meter minimum * 


Voice, Tone, and 
Interrupvted Tone 


Electrostrictive 
(lead zirconate) 


Omnidirectional 


Bone conduction 
( two ) 


Bone Conduction 
(two ) 


No information 
(under test at 
U. of Fla.) 


300-3000 Hz * 
(2700 Hz) 


the lung cavity is in the line of transducers. This is a serious 
defect when one user 7s unaware of the relative location of the second 
user. 

The communicator bandwidth operates on the telephone principle — 
300-3000 Hertz bandwidth will provide high intelliaibility. Studies 
indicate for shallow depths that this bandwidth will provide intelli- 
gibility of 95+% [1]; but this is not necessarily true for all conditions 
encountered by the diver. 

Evidence points to bone conduction as the primary source of 
hearing underwater [4]. All currently available communicators utilize 
bone conduction reception; one of the tested communicators used bone 
conduction transmission in place of the normal microphone in the oral 
cavity. Some tests indicate a definite loss in intelligibility when 
using bone conduction transmission; but no quantitative results support 
these conclusions. The use of a single hose requlator with bone con- 
duction transmission caused the signal to be obscured by bubble noise 
passing the conductors. 

These problems are not unique to the tested communicators but 
a far more important set of problems are those common to all communi- 
cation systems for free divers. 

2. Problem Areas 

The most complex problem is that of speech distortion caused by 
the addition of a small] cavity over the mouth. To date little basic 
research has been attempted in an effort to design an optimal oral 
cavity. The design must balance two conflicting variables — small 
volume to prevent low resonances near formants or articulated sound and 


buildup of carbon dioxide, and large volume to prevent increased 
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pressure from affecting the sneaker in articulation. The Bioengionics' 
Nautilus oral cavity (see Figure 3) was used to aid speaker articula- 
tion in this research. This oral cavity has been used successfully to 
increase intelligibility of systems in other tests and nrovides an 
excellent trade off in cavity design and diver safety [5]. 

Increased operating depth results in increases of ambient pres- 
sures. This changes the characteristics of the resonant cavities and 
the density of the breathing gas. When a diver is operating at pres- 
sures of four atmospheres (100 feet) or greater, the voice attains a 
nasal quality and causes considerable loss of intelligibility [5]. 

When exotic breathing gases are substituted for the primarily 
nitrogen-oxygen gas, the voice attains a quality best described as the 
"Donald Duck" effect. The use of exotic gases causes formant shifts 
due to the increased sound velocities. Unfortunately the shifts are 
non-linear; this makes correction a complex problem. The amount of 
distortion due to these effects is under study by conventional tech- 
niques by the Communications Sciences Laboratory, University of Florida 
[1], and other agencies [6,7,8]. 

Physiological problems are of interest to ensure that human 
engineering of communicators are accentable to the diver when operating 
in an alien environment and commensurate with desired safety. The 
diver feels an almost personal relationship with his equipment. The 
communicator must require little or no change in equipment for the diver 
to accept it unhesitatingly. A prime example is the addition of the 
oral cavity, a much needed item for proper articulation. The original 


design of the Bendix PNC-2 excluded use of a cavity for this reason. 
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Desian of workable underwater communtcators is feasible; but a 
great deal of research is necessary before a single system will over- 
come the vast number of problems. The most critical item is development 
of a method to solve the deep depth and exotic gas sneech distortion and 
then microminaturization of the “unscrambler" for inclusion in the free 


diver's equipment. 


Be SPEECH BROCESSING 

To the engineer engaged in speech research, "speech processing" 
implies synthesis and analysis, automatic recognition and speech com- 
pression. The linquist will speak of the acoustical model in speech 
production or perception and/or various articulation and intelligibil- 
ity testing. The language of speech research is foreign to the beainner; 
therefore, Appendix A includes a glossary of speech and linguistic terms. 

As previously mentioned, the requirement exists for a technique 
that will enable the speech analyst to accurately determine the effects 
of an alien environment, such as that encountered by the diver, on the 
human voice. 

Some of the techniques used for speech analysis are summarized with 
a brief description of the new "wrinkle" in analysis following. 

1. Analog Techniques 

a. Recognition Analysis 
The vowel sounds have been the subject of intense research. 

The most common result has been the classification of the vowels by 
their formants, primarily the relation of the formants to each other, 
and has been labeled "Visible Speech" [9]. The device most commonly 


used is the "acoustic spectogram" where a visual representation of the 
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spoken messaae or phoneme is displaved in frequency versus time. 
The intensity, a third parameter, is displayed by the darkness of the 
plot. Thus, the transient spectrum can be viewed. A spectoaram of the 
word "at" is shown in Fig. 4. The chief disadvantage of the spectoaram 
is the lack of resolution. 
b. Power Spectrum Measurements 
The analog spectrum analyzer is a convenient tool to view 
the spectrum of a signal instantaneously. It works on the principle 
of a constant width bandpass filter with a variable center frequency; 
however, high precision measurements are not possible. Another tech- 
nique that borders on being classed as digital is the use of a bank of 
narrow bandpass filters with increasing bandwidth as their center fre- 
quency increases. Until recently, size and cost have limited their 
use, with resolution stil! the determining factor. 
c. Waveform Analysis 
The fact that vowel sounds can be characterized by their 
waveform asymmetry has been noted. Pairs of phonemes can be compared 
and separated on the basis of the magnitude and polarity of the output 
of a demodulator [10]. 
d. Articulation and Intelligibility Tests 
To evaluate communication systems, the articulation tests 
were widely used during World War II. The results of the tests aided 
hardware design. Intelligibility testing takes the form of reading 
from phonetically balanced word lists and assigning a nercentaqge score 
for the correct responses of human listeners. The consistency of such 
tests demonstrates their reliability and makes them the most commonly 
used; however, the search for optimum lists of sounds, words and phrases 
continues [5]. 
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oe Digital Techniques 


The analysis of speech by digital techniques owes a great 
deal of its popularity to the interest in the development of the 
"vocoder", or voice coder. Linguists and psycholinguists have been 
a trifle reluctant to master the techniques of the digital world [11]. 

Digital Spectrum analyzers prior to 1965 consisted of narrow 
band digital filters. The number of digital filters necessary for 
sufficient bandwidth and resolution required extensive memory and 
computing time; an excessive amount for speech waveforms. 

a. The Fast Fourier Transform 

The capability for high resolution spectral analysis was 

not available until recently discovered algorithms were imnlemented 
[12]. The algorithm of interest in this paper is the Fast Fourier 
Transform (FFT) [2]. It is an efficient algorithm for computing 
the "discrete Fourier transform’. The number of operations is reduced 
from Ne to 2NlogoN, where N is the number of samoles. Thus, it can 
be used for offline spectral analysis of processes. The advantage 
of the FFT is illustrated by comparing the number of operations re- 
quired for N = 8192 samples. Ne is in excess of 67 million compared 
to 213 thousand for the FFT. Such a saving in overations vastly 
reduces the computing time. 

b. Spectral Measurements 

With the FFT as a tool available in generating the Fourier 

coefficients, several avenues of analvsis are plausible. The avenue 
chosen is the power spectrum. Other spectra, such as the cesptrum, 


are available but were not considered. The FFT lends itself to such 
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uses and could be apnlied in future research in this area. Bv claimina 
that the power spectrum is available, it must be clear that the spec- 
trum is only an estimate. How good the estimate is depends upon the 
methods of imnlementing the algorithm. First, the siqnal is of finite 
length, hence the number of samnles is finite. Second, the signal 

is bandlimited. The assumption is made that the siqnal is a member 


of a stationary process. 
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TT. DATA COLLECTION 


After consultation with authorities in speech processing and 
underwater communications, and after considering time, funds, and 
human effort available, the decision was made to proceed to collect 
data to support two independent projects. This decision was influ- 
enced by the loan, free of cost, of the two underwater communicators 
used in the data collection. Certain control sounds and words, shown 
in Table II,were recorded to support the digital speech processing. 
Two vowel and four consonants were arbitrarily chosen. One sound 
of each main subgroun was included [9]. This data was originally 
intended to assist in isolating distortion in the two communicators. 
The second project was designed to assist in judgina the relative 
intelligibility of the two systems under various conditions. Two 
Campbell word lists were recorded by each diver. The Camobell word 
lists are monosyballic, phonetically balanced lists of 25 words each, 
selected randomly from a master list of 500 words. Table [II is an 
example of such a list. 

Oriaqinal nlans called for data to be taken in two locations, an 
anechoic pool free of outside interference and the ocean to nrovide 
a noisy environment with varying conditions of range and denth. Sea 
conditions and boat availability in January and Februarv, 1969 
allowed only eight (8) days to be used for taking data; although the 
effort was made daily. Finally after 30% of the desired data had been 
collected, theft of equipment, lack of a stable platform in heavy seas, 


poor ocean visibility, and shortage of personnel dictated a change to 
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Phoneme 


/AE/ 
/e/ 
/P/ 
/G/ 
[Sf] 
Les 
/TH/ 
/V/ 


DOLLS 
NUTS 
BOOK 
JUMP 
ART 
CARVE 
OR 
ARE 


TABLE [1 


Control Sounds and (lords 


Word 
AT 

MET 
PAY 

GO 
SHOULD 
FIRST 
THIN 
VOTE 


TABLE III 


Class of Phoneme 


Vowe | 

Vowe] 

Voiceless Ston 
Voiced Ston 
Voiceless Fricative 
Voiceless Fricative 
Voiceless Fricative 


Voiced Fricative 


Sample Campbell] List (P-7) 


AIM 
CHAIR 
SMOOTH 
ARM 
POOR 
HIGH 
SKIN 
LIE 
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COOK GIVE 
AID 

AM 

ACE 

UP 

YES 

THREE 

HURT 


a quiet, fresh water environment having a stable platform. Two under- 
water communicators, the Hydro-Products 811 and the Bendix PONC-2 were 
used to transmit the data to an AN/WONC-1A communication set. Charact- 
eristics of this set appear in Table IV. This receiver fed a Precision 
Instruments DA 6200 tave recorder. The data was recorded at 37.5 ips 
and stored on magnetic tape. All recording used frequency modulation 
to allow for subsequent linear frequency translation by changing tane 


recorder sneed. 


A. ANECHOIC POOL 

The anechoic pool located in Room 025 of Snanagel Hall, Naval 
Postgraduate School, was utilized to provide the noise free environ- 
ment. The diver was dressed in normal diving equipment as shown in 
Fiq. 5 and 6. Initially the diver svoke into free space as the 
control sounds, control words, and two Campbell word lists were 
recorded. Then an oral cavity was placed in position and the proce- 


dure repeated. A block diagram of the test setup is shown in Fia. 7. 


B. OCEAN RANGE 


The ocean range is located in Monterey Bay 1500 meters off Del Monte 
Beach. The location is shown in Fig. 8. The underwater nortion of 


this range is still located as indicated. The range is located in 100 
feet of water over a flat sandy bottom to minimize bottom reflection 
interference. Fig. 9 and 10 show the range layout. A diver platform 
is located at the main buoy to allow orientation of the diver toward 
the receiving transducer. This orienting device prevents signal loss 
due to attenuation by the lung cavities, wet suits, and other items of 


diver dress. Figure 1] shows a diver in the polatform readv to 
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TABLE IV 


Characteristics of AN/WQC-1A Communication Set 


OPERATION AM/SSB-SC 

FREQUENCY Sude= 11 ocKHz 

MODES Receive, Transmit, Tone 
TRANSDUCER Omnidirectional w/o Baffle 


15 db down 60°-330° w/ Baffle 


BANDWIDTH 300-3000 Hz 
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Diver in Anechoic Pool 
Without Oral Cavity 


FIGURE 5 





Diver in Anechoic Pool 
With Oral Cavity 


FIGURE 6 
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Diver in Orientation Platiorm 


FIGURE Ill 
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communicate. Fiaure 12 is a block diaqram of the test set-up. Note 
that the tane recorder is mounted on an unstable nlatform (the boat) 
and must remain within short ranae (about 15 meters) of the desired 
buov range marker. Table V indicates the desired conditions for 


recording data. 


C. FRESH WATER RANGE 

The fresh water range was set up in San Vincente Reservoir, San 
Dieqo County, California. Figure 13 shows the site. It differs from 
the previously described ocean ranqe in four ways. 


1. The granite walls and concrete dam face made receiving 
transducer orientation critical to prevent side reflections and 
minimize reverberation. This was accomplished bv susnendina the 
transducer on one line and using two quidelines to insure exact 
pos tmening. 


2. The tape recorder was placed on a stable barae and the 
diver moved to different locations, thus reversing the roles in the 
ocean. 


3. Nuiet, fresh water replaced the noisv ocean environment. 


4. Diver orientation was accomnlished using a comnass. 
Table V shows the desired conditions for data collection. Eight 
davs were required for movement to the reservoir, set un on the 
barqe, collection of data and return to Monterey, California. 
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Sen Vicente Reservoir Test Site 


rIGURE 13 


oF 


TABLE V 


Recording Conditions for Ocean and Fresh Water Ranges 





DIVER BOND MYATT 
SET PQOC-2 HP SIll PQC-2 Qsiiaeaim 
RANGE DEPTH 
10 20 X Y if X¥ Y 
10 90 X Y Y X Y i 
10 80 oy y x i 
100 20 X Y Y xa Y 
100 90 Kory i X Y Y 
100 80 ot Y me i 
300 90 y Y Y i 
500 90 i i i ul 


X — DATA RECORDED IN OCEAN RANGE 
Y — DATA RECORDED IN FRESH-WATER RAGE 
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ITT. DATA PROCESSING 


The instruments available at the Naval Postgraduate School for 
the processing of the data include the IBM 360/67 comouter, the SDS 
9300 medium sized comouter with a hybrid interface to a COMCOR 5000 


analog computer. 


A. ANALOG TO DIGITAL CONVERSION 

The method of processing the data depended largelv on the cana- 
bilities of the hybrid computer setup of the COMCOR 5000 and the SDS 
9300. In order to digitize the data from the tapes, the samnlina rate 
of the analog to digital (A-D) converter had to be at least twice as 
high as the largest frequency of interest (the Nyquist frequency). 
Unfortunately, the present write scheme of the SDS 9300 limits the 
samnle rate to 2500 Hertz. This virtually eliminates the use of the 
A-D converter for the entire audioband. It was determined that the 
frequencies of interest for this exveriment slightly exceed the pass- 
bands of the two communicators, 1.e., 300-3000 hertz; a vassband of 
200-4000 hertz was used. 

The sampling frequency dilemna was circumvented bv utilizing the 
features of the FM tape recorder. The PI 6200 instrumentation tape 
recorder will record at three sneeds - 37.5 ins, 3.75 ips, and .375 
ios. The center frequency of the carrier is governed by the choice of 
the tane sneed - 50,000 hertz, 5,000 hertz and 500 hertz, resnectively. 
Suitable lownass filters are switched into the circuitry for these 


speeds. At 37.5 ips the tape recorder input yields O db gain from 


oe 


dc to 10,000 hertz, with a signal to noise ratio of 42 db. By olavina 
back at 3.75 ios, the discriminator frequency is divided by a factor 
ten, as are the siqnal frequencies. This linear frequency translation 
allows a samnle rate of 800 hertz to cover the translated audioband. 
The choice of parameters for sampling the data was hampered by 
the necessity for trade-off in frequency and time resolution, often 
referred to as the analogy to Heisenberas “uncertainty nrincinle" [13]. 
Iqnorina statistical stability and those terms discussed in references 


on nower spectra, the frequency resolution, Af, is defined as: 


Af = fer as 
where Lis = sampling frequency 
$= number of samnles 


The dilemna is obvious. The larger the number of samnles, the smaller 
the separation between spectral lines, hence, greater resolution. The 
larger MN also extends the time duration the siqnal must be samnled, 
hence a loss of time resolution. To nrevent aliasing, the samnling 
frequency must be at least twice the highest frequency of the sianal. 
The larger sampling frequency reduces the frequency resolution. 

The solution was to investigate the data for both time and fre- 
quencv resolution. The time resolution was increased by using a lower 
number of samples for calculating an "evolutionary spectrum" and a 
larger number of samples for a spectrum of high resolution. 

The samnle rate was selected to be 2048 hertz. This frequency was 
selected because the FFT algorithm is most efficient when ie 1s some 
number that is a power of two. It is more than five times the highest 
frequency of interest; i.e., 400 hertz (4,000 hertz translated by a 


factor of ten). 
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The number of samples taken must necessarily be a number that is 
a power of two for the FFT algorithm. Thus, the number of samoles taken 
for each record was 8192. This allowed the siaqnal to be samnled for 
a period of 0.4 seconds. The maximum resolution possible is 0.25 
hertz, translated by the factor of ten to 2.5 hertz, real time. 

To samnle the data, the tape recorder output was filtered by a 
Krohn-hite adjustable bandpass filter, Model 310 A. The character- 
istics of this filter are shown in Table VI. The frequency limits 
were 20 and 400 hertz. The signal was then amnlified by a factor of 
30, and digitized with the clock rate controlled bv the Wavetek 116 
oscillator. The block diagram of Fig. 14 shows the sampling setun. 

The signal was monitored by audio means and visually on an oscil lo- 
scope to determine the proper time to commence diaitizing. Herein 
lies a fundamental weakness in this experiment; it will be discussed 
more fully in the analysis section. Briefly, no control was used to 
ensure that the sampling process started at the same instant the 
nhoneme commenced. 

A vast amount of data was recorded for analysis. With time 
becoming a factor, it was necessary to limit the number of nhonemes 
examined. 

The phoneme selected was |AE|, the vowel sound in "at" - a front 
vowel. The same phoneme was sampled for the following conditions - 
each communicator at a range of ten meters and depth of 8f) feet, and 
each communicator in the anechoic pool with and without oral cavity. 

The sampled data from the SDS 9300 was placed on magnetic tape. 
The output was seven track in 24 bit octal format. The samnle routine 


was an in-house routine developved by former NPGS students [14,15]. 
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TABLE VI 


Characteristics of Model 310A Krohn-hite Filter 





KROHN-HITE MODEL 310C 
NORMALIZED FILTER RESPONSE 


Attenuation: 24 db/octave below -]2db point 


Input Characteristics: 6 Megohms in parallel with 50 unf 
5 volts rms maximum 


Output Characteristics: 200 ohms, 12 milliwatts 


42 


LAdLAO 
didV1 
VW 








(ade sudqul 
pLuqdy) 





vl 3uNdI4 
weubelg 490[g bur, dwes 





| (407 euedwWo9 
03 ynduL) 
AIO 19 








me 
000S19 
YOIWOd cS 
a i a O€=N1V9 
(00G1 09 


qyndut jeubls) 


j 


re re I a aD a 


4 


asi 
SSVdQNV9 
JLIHNHOUN < 





Y3qY093u 
ddV1 
0029 Id 





43 


B. INTERFACING 

The output of the SDS 9300 is written in seven track. As is the 
case with all computers when trying to mate with the IBM 360, the 
interface is not as smoothly refined as might be exnected. The seven 
track tape must be converted to nine tracks and the 24 bit, octal 
format converted to 32 bit, hexadecimal format. The program required 
is listed in Appendix B. 

The tane drives of the IBM 360 required perseverance on the part 
of the onerators. The tane from the SDS 9300 was accented roughly 
one out of five tries. 

No attempt was made to store this data on a direct access device. 


However,this seems desirable to conserve computer time. 





C. DIGITAL PROCESSING 

The data was stored on magnetic tane in 32 bit words. The sub- 
routine Harm, vart of the Standard Scientific Package for the IBM 360/67, 
was used to calculate the estimates for the Fourier coefficients of 
the signal. This routine is based on the Cooley-Tukey algorithm. 

As previously mentioned, the entire 8192 data noints could be used 
to compute a snectrum of high frequency resolution but with time uncer- 
tainties. Alternately, only a portion of the data points could be 
used with decreased frequency resolution but with increased certainty 
in the time domain. 

The techniques complement each other. The technique chosen comnuted 
a) a power spectrum using the entire 8192 points, and, b) to assist in tH 


analysis of this spectrum,comnuted a transient spectrum for the first 


a4 


2048 points. The two spectra are titled "spectrum" and "transient 
spectrum." The snectrum has a resolution of 2.5 hertz. The transient 
spectrum has a frequency resolution of ten hertz. 

The ontion exists whether or not a smoothing function is necessary. 
The need for a smoothing function might be described by mentally 
visualizing the side lobes of the sample function. These side lobes 
cause "leakage' from one sample to another. A smoothing function 
such as the now familiar "Hanning spectral window" reduces the side 
lobe leakage from ewe) to ee [16,17]. Whether or not the 
smoothing function is desirable denends on the tyne of siqnal. The 
literature says that noise-like signals with relatively flat spectrum 
are not suitable for smoothing functions, but when the signal is non- 
noise-like and the nosition of signal enerav in the frequency domain 
1S more important than the amount of energy, then a smoothing function 
1S appropriate. The "Hanning window" was used. The estimates of the 


Fourier coefficients were overated on as follows (for '! coefficients), 


A(k) = -%a(k+1) -% a(k-1) + % a(k) 
B(k) = -% b(k+1) -% b(k-1) + % b(k) 
A(O) = %(a(0) - a(1)) 

BCC) ae=— 0 eal eed 


At the exnense of a reduction in "statistical stabilitv", this 
window was used; its time domain equivalent beina, 

A = el COS / 1) ex) eral ms Oy Re eee | 
A second smoothing oneration was performed after the coefficients of 
the power spectrum were computed. 
\ + B(k)é 


= A(k)” + B(k 
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This is again the familiar "Hanning window" [17]. Since the main 
concern is that the estimates are anpronriate to the riominal fre- 
quencies, the linearly hanned spectrum suits its purnose. 

The power spectra were nlotted on a logarithmic scale, allowing 


a forced scale suitable for the entire dynamic ranae of data noints. 
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TV. ANALYSIS 


The phoneme selected for study was not an arbitrarv choice. The 
noise-like sounds such as fricatives or stops do not lend themselves 
to Fourier analysis [18]. The vowel sounds, however, have a steady 
State quality, with slowlv changing transient spectra. The front 
vowel, /AE/ as in "at", demonstrates a well defined snectrum with 
three important formant frequencies. The literature shows general 


agreement on the values of these formants: 


F, = 660 hertz 
Fo = 1729 hertz 
F. = 2410 hertz [19] 


Associated with these formants are bandwidths of 60, 100, and 120 
hertz, respectively. One reference shows the formants to be somewhat 


lower. 


wT 
il 


579 hertz 


_ 
il 


1691 hertz [20] 


A. FORCED AND NORMAL SPEECH 

In order to establish some sort of reference, a diver's voice was 
recorded on a Kay Missilyzer snectoaranh. Fiqure 15 shows the front 
vowel /AE/ recorded in an anechoic chamber. An Altec 21BR150-2 micro- 
phone was used and the diver spoke in a normal voice. Fiqure 16 1s 
the same sound spoken in a loud, forceful manner, just as if the diver 
were using an underwater communicator. The requirement for the forced 


speech to increase intelligibility has been documented [1]. 
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The formant frequencies for the normal sneech are shown with 
arrows in Figure 15. They generally agree with the values cited 
previously. Figure 16 shows the formants greatly increased. The 
effort to increase the level of sound has caused the diver to lift the 
pitch of his voice in an unnatural manner. The snectrum is not constant 
but rises rapidly to a maximum value for each formant and then slowlv 
decays. This change in the spectrum is of major importance. Further 
verification of this difference in the forced and normal speech nower 
spectral estimates is borne out by Figures 17 (a-c) and 18 (a-c). 
The normal speech power spectral outout closely approximates a line 


spectra where shifts can be easily measured. 


B. FREQUENCY TRANSLATION 

To check the frequency translation of the PI-6200 tane recorder, 
a 384 hertz tone was recorded at 37.5 ios and samnled at the rate 
of 2048 hertz. The power spectrum is shown in Figure 19. Figure 20 
Shows the power spectrum when the tape recorder speed is slowed to 


3.75 1ps. The expected change in frequency by a factor of ten occurred. 


Cae POWER SPEGRRAE ESTIMATES 
1. Without Oral Cavity 
A sample spectrum for the recording made using the commun- 
icators without oral cavity is shown in Figure 2l(a-c). The spectrum is 
cluttered with power at frequencies in the neiqhborhood of the for- 
mants. This clutter is sufficient to vrevent accurate measurement of 
the formants; but should be anticipated in light of the observations 


made concerning the rapid changes in the snectrum in Figure 16. This 
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FIGURE 17(a) 
Normal Speech Spectrum of /AE/, 0-1000 Hz. 
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FIGURE 18(a) 
Forced Speech Spectrum of /AE/, 0-1000 Hz. 


51 


Power in db. 


Power in db. 
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FIGURE 17(b) 


Normal Speech Spectrum of /AE/, 1-2 KHz. 
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FIGURE 18(b) 
Forced Speech Spectrum of /AE/, |-2 KHz. 
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FIGURE 17(c) 


Normal Speech Spectrum of /AE/, 2-3 KHz. 





frequency 


FIGURE 18(c) 


Forced Speech Spectrum of /AE/, 2-3 KHz. 
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FIGURE 19 
384 Hz. Tone at 37.5 ips 
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FIGURE 20 
Tone at 3./5 ips 


384 Hz. 
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Power in db. 


Power in db. 
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FIGURE 21(a) 


Phoneme /AE/, Without Oral Cavity, O-1 KHz. 
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FIGURE 22(a) 


Phoneme/AE/, With Oral Cavity, O-1] KHz. 
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FIG. 21(b) 


Phoneme /AE/, Without Oral Cavity, 1-2 KHz. 
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FIG. 22(b) 


Phoneme /AE/, With Oral Cavity, 1-2 KHz. 
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Phoneme Without Oral Cavity, 2-3 KHz 
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Phoneme /AE/, With Oral Cavity, 2-3 KHz. 
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spectrum was estimated using 8192 samnle points. Theoretically, the 
frequency resolution is 2.5 hertz. In reality, only a rough estimate 
can be made to determine the formants within 50 hertz. Blaming the 
poor results on the manner in which the diver forced the sound, the 
next step was to compute an ‘evolutionary spectrum', (transient 
spectrum). Using the first 2048 samnle points, the theoretical resolu- 
tion is ten hertz. The transient spectrum for the first 2048 points 
is Shown in Figure 23a-c, This covers a signal for 0.1 second. The 
obvious blunder is that no technique was used to ensure that the 
sampling process began precisely at the beginning of the sound. This 
is critical in the transient spectrum, in that an additional uncer- 
tainty has been introduced. That ts} does the samnling start before 
the snectrum reaches the highest values in frequency? If the siqnal 
had been the phoneme recorded in the normal voice, this would not 
nave been critical since the snectrum is flat for a reasonable period 
of time. 

Since the data was already recorded, it was not possible to 
record on an adjacent channel, any form of signal to coincide with 
the start of the sound so that the sample process could have been 
automatically triggered. The alternative was to simultaneously 
monitor the signal by audio and visual means and manually triqger the 
Sampling process. Obviously this technique has flaws and the exact 
point of entry into the data stream is uncertain. 

Even with the ambiguities, the fact that the formants are shifted 


un in frequency due to the forced sound level is apparent. The first 
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Transient Spectrum, Without Muzzle, 0-3 KHz 
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FIG. 24 frequency 
Transient Spectrum, With Muzzle, Q-3 KHz. 
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Transient Spectrum, 80 feet, 0-3 KHz. 


60 


formant can be estimated at 840 hertz, in agreement with Fiaure 15. 
The second formant is centered at 1720 hertz and the third at 2550 
hertz. Bandwidth is extremely hard to estimate. 
2. With Oral Cavity 

A sample spectrum for the recordings made using the commun- 
jcators with the oral cavity is shown in Fiaure 20 (a-c). The litera- 
ture concerned with small, non-radiating enclosures, such as the oral 
cavity used, states that in theory the effect of the muzzle is to 
raise the formants in frequency; however in reality, the distortion 
1s much more complicated. "... The formants may become indistinct 
rather than shift in any direction..." [21]. These observations are 
partially borne out in the spectra referenced ahove. F does appear 
to be indistinct as does Fa. F contains the major concentration of 
power, the center being shifted down instead of un in frequency. The 
inability to massage the data to make concrete observations is 
regretable. 

3. Increased Ambient Pressure With Oral Cavity 

A sample spectrum for the recordings made at 80 feet is shown 
in Figure 26 (a-c). The effect of increased pressure is well documented 
[1,5,6]. The formants should shift upward in freauency in a non-linear 
fashion. "...the first formant is affected more than the second 
formant. It is also observed that the intensity level of voiced 
sounds increased with increasing ambient pressure but that tynical 
noise sounds, such as fricatives, and the burst part of unvoiced stons 
display a drop in intensity relative to voiced sounds ..." [6]. 
This referenced source cites a theoretical expression for computing 


the unward shift in frequency of the first formant. 
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FIG. 26(a) 


Phoneme /AE/, 80 feet, O-] KHz. 
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FIG. 26(b) 


Phoneme /AE/, 80 feet, 1-2 KHz. 
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FIG. 26(c) 
Phoneme /AE/, 80 feet, 2-3 KHz. 


63 


056 


2600 


2800 
frequency 


Ce te 





F oa F 2 ae 2 
wl 


lp (p-1) 


where Fp is the frequency of the first formant at n atmosnheres 





pressure, Fa, is the first formant at one atmosphere, and Fy is the 
resonance frequency of the closed vocal tract at one atmosphere. 
Applvina this expression to the situation under discussion, namely a 
depth of 80 feet, the theoretical first formant shift can be calculated. 
Assuming a resonant frequency of 170 hertz, and using the estimate of 
840 hertz for Fay: with p = 3.42 atmospheres, a 1s comouted to be 
878 hertz, a shift of 38 hertz. Examining the spectrum for 80 feet 

in Figure 26, the first formant can be roughly estimated to be 910 
hertz, significantly larger than expected. The source of this shift 

is uncertain; it could be related to the increased effort of the diver, 
as well as to the increase in pressure. Fo and F are indistinct, 


the power distributed in many frequencies throughout the band. 


V. CONCLUSIONS AND RECOMMENDATIONS 


The original objective of isolating the distortion of the under- 
water communicators was not accomolished. Ths problem narrowed to 
develoninag a tool to examine the sneech power spectrum in detail with 
high resolution. It is felt for non-noise like phonemes that the 
FFT is the tool available to the analyst to perform this high resolu- 
tion analysis. Some of the final massaged output reinforces this 
belief. Line snectra can be the final result of speech wave forms if 
a number of the troubles encountered are corrected in subsequent 
research. To assist in future work a summarv of the troubles encoun- 


tered and recommendations for correction are submitted. 


A. DATA COLLECTION 

1. Equipment 

The proper equipment must be available to sunport the research 

in all phases of data collection and remain available for recall during 
the processina phase. This affects a number of topics, range location, 
etc., but is best illustrated by the necessitv for long term availabil- 
ity of the underwater communicators. In this exneriment, the communi- 
cators were on short term loan and a number of experimental techniques 
were hastily contrived to support the data collection. 

The proper diving equipment must be made available to sunport the 
collection effort. Diver comfort in cold water requires the addition 
of heated suits to prevent changes in speech due to loss of heat. The 


orientation problem, discussed later, calls for a helmet or hood tvpe 
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lantern. The type developed for Sea Lab III by Ben Saltzer of Naval 
Undersea Warfare Center, Pasadena, California is recommended. Short 
term loan of personal equipment and use of outdated equipment comnli- 
cates the overall effort and reduces efficiency. 
2. Tape Recorder 
The tane recorder deserves special mention; the choice is 
critical. The PI 6200 was chosen because it could be adonted for 
dc power and provided the necessary linear frequency translation due 
to the slow sample rate. The PI 6200 tape recorder is delicate and 
required special handling, but is suitable for use at a fixed recording 
Sie. 
Seenance Oca llon 
The range location is critical. The attempt to set un a range 
in Monterey Bay during the worst weather in years cost many man hours. 
The range was completed and used; however theft by local fishermen 
neqated the effort. If Monterey Bav is to be used in the future (it 
should not!) the summer months provide the best ocean conditions, and 
notices should be out out to all fishermen asking them to coonerate in 
avoiding destruction of the range. The Monterey Harbormasters Office 
will assist in this respect. Time and effort can be saved if a better 
location is used, specifically, Stage I at NSDRL, Panama Citv, Florida. 
4. Diver and Receiving Transducer Orientation 
Diver orientation must be closelv controlled as must the 
orientation of the receiving transducer. The luna cavities and wet 
suit cause the transmission of the communicators to be highlv direc- 


tional. If the diver is not oriented proverly, the sianal strenath will 
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be reduced; similiarly the WOQC-1A transducer is directional, so must 
be oriented to provide maximum signal reception. In Monterey Bav the 
divers orientation was fixed by a nlatform allowing him to sit down. 
No attemot was made to orient the transducer in the ocean ranqe. At 
San Vicente Reservoir the granite walls and concrete dam face required 
that the receiving transducer be oriented to nreclude reflections. The 
diver was oriented using a wrist comnass to maintain the nroner line to 
the receiving transducer. The diver held a flashlight in one hand to 
assist in reading a word list in total blackness, while he clung to a 
safetv line to control his depth and location. It 1s no wonder the 
result was fading due to diver drift from the nroner diver/transducer 
line. A setun of the DICORS type would materially assist in solving 
the orientation problem [5]. 
5. Safety 
Additional divers, preferably with a technical backaround, are 
needed for safety and convenience. Throughout this research, the divers 
violated normal safety standards by diving alone. This left one man 
at the surface to control and monitor the myriad of controls for re- 
cording. The ability of the tonside man to monitor the data collection 
with confidence was hopeless. 
Gee Pur ous Noises 
The attenuation of noises associated with tane recorder heads, 
interference of 60 cycle nature, surface and bottom reflections must 
be constantly monitored. Data should be recorded only if optimum 
conditions exist. 
7. Forced Speech 
The data collection must support the data vorocessina. The verv 


fact that two independent projects were undertaken reduced the 
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effectiveness of either project. The method of collecting data for 
intelligibility tests is vastly different than that to support the 
Speech processing — the subject of this paper. Intelligibility tests 
are optimized when the diver raises the level of his voice and makes a 
determined effort to speak clearly. However, as mentioned repeatably, 
the task of isolating the problems of distortion by use of the power 
Spectrum requires the diver to speak in the same manner throughout. 
The line spectrum desired is most nearly approached when the voice 

is articulated in a normal manner. This implies that instead of 
forcina the voice on the surface to agree with the technique used for 
clarity at depth, the voice must be used in a normal manner at each 


Stage of the experiment, a reverse of what was done. 


B. DATA PROCESSING 

The processing technique must be thoroughly understood and be used 
successfully prior to the collection of the data. Lack of a hybrid 
interface and limited availability of the underwater communicators 
precluded these conditions being met. 

1. Sample Rate 

The hybrid computer facility should modify the write scheme 

to ensure that the A-D conversion rate is high enough to accommodate 
Speech siaqnals. The desired rate is at least ten kilohertz. The 
analog computer, the COMCOR 5000, is presently rated at 25 kilohertz; 
the limiting item is the diaital computer, the SDS 9300. The limita- 
tion of 2.5 kilohertz forced the conversion of the data at a playback 
Speed of 3./5 ips, one-tenth the record speed. In theory, no error 


is introduced but it is desirable to process at the record speed. 
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Cer WicerinG 
Filtering of the signal prior to A-D conversion should be 
variable and of high quality. The Krohn-hite filter is suitable for 
such work and in many ways 1S superior to a fixed Bandpass filter 
using active devices. A voltage controlled bandpass active filter 
would also be suitable. 
oe Nleenniacing 
Converting the seven track, 24 bit tane to a nine track, 32 
bit tane was a narticular stumbling block, requiring long hours to 
overcome. Appendix B is included to assist in overcoming this 
problem in the future. 
4. Output Format 
The output from the IBM 360 can be displayed in a variety of 
ways; however, the CalComp Offline plotter is most suitable for 
analysis. A more efficient draw routine would make processing easier 


and save compute time and core memory. 
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APPENDIX A GLOSSARY 


PHONEME— one of a finite (40) number of sounds that characterize 


speech. Six major subgrouns (classes) are shown in Table VII. 


FRICATIVE— a noise-like sound, primarily produced by blowing air over 


meme teeth, as in 'S ssn sand ~T..” 


STOP— a sound produced by holding the breath and exnloding it ina 


Bist OnmenOise., asesm <p. ct . and Kk.” 
VOWEL— a steady state sound as in all common vowels. 


GLIDE— a sound formed by continually moving the articulators 


(mouth, tongue, etc.), as in "1", "r", and "w." 
NASAL— "m" and "n"-like sounds. 
DIPTHONG— two or more vowels sounded together 
Pee eds top 
FORMANT— the resonant frequencies of the vocal tract. 


AFFRICATIVE— a nhoneme beginning with a stop and ending with a 


fricative, as "ch" in the word "church." 


PITCH— a qualitative dimension of hearing related to the highness 
or lowness of a tone and the sound intensity (with increased 
intensity, a relative change in pitch can occur with a change 


in frequency. ) 
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TABLE I [21] 


FRICATIVES 


/h/ house 
/f/ fee 
/e/ thin 
/s/ see 
/j/ she 


GLIDES 


/w/ we 
/j/ you 
/r/ read 
/\/ let 


VOWELS (including dipthonas) 


STOPS 

/0/ Shae /b/ he 
(c/o /d/ | day 
/k/ key /9/ go 
NASALS 

/m/ me 

/n/ need 

/n/ song 

/i/ eve /AE/ at 
/1/ oe /i/ ask 
/e/ hate /A/ un 
/é/ met /ei/ say 


/5/ father 


fi! boy 
/pf not 
Jo) eal 
/«f/ obey 


7] 


/V/ 
/o/ 
al 
/3/ 


/u/ 
[ue] 
/ai/ 
/au/ 
/ou/ 


vest 
then 
Z00 
azure 


foot 
boot 
I 


go 


APPENDIX B 


The purnose of the anpendix is to aive a brief but complete 
descrintion of the mechanics involved in digitizing data and inter- 


facina from the SDS9300 to the IBM 360/67. 


A. ANALOG TO DIGITAL CONVERSION 

The analog sianal is the input to the Comcor CI 5000. For all 
Signals, a bandpass or lownass filter network is necessary to nrevent 
aliasing, a major problem. The unper cutoff frequency should be the 
hiahest frequency of interest. The filtered signal is pluqaed into an 
amnlifier input on the CI 5000 and amplified by the highest possible 
factor within the +100 volt limits of the operational amnlifiers. 

This is required to yield the highest possible sianal to noise ratio. 
The output of the amnlifier is the input to Trunkline 500(T500) on the 
analoa patchboard. This links the analog to the digital computer. 

The samplina frequency can be selected un to and includina 2500 
hertz for the present digital magnetic tane write scheme. If the analoa 
clock nulse is used, a selection of 10, 100, or 1000 hertz is available 
on the logic patchboard and should be connected into the Master Clock 
Innut (MC IN). The outnut of the Master Clock (MC) is pluaqqed into 
Trunkline 210 (17210) on the logic natchboard. 

If a variable samnlina frequency is desirable or if a rate differ- 
ent than what is available on the logic board is required, an oscillator 
can be nluaaed into a comnaritor input, (say C03) on the analog natch- 
board. The outnut of the comnaritor on the logic patchboard is pluqaed 
into Master Clock In (MC IN). The output of the Master Clock (MC) jis 


connected to Trunkline 210, (7210). 


ie 


The sampling proqram for the SDS 9300 is titled SAMPL, and was 
developed by Lt. Lynn Dorrian for the SDS 930 [4]. It was modified 
for the SDS 9300 by Lt. Jerry Post and Mr. R. Limes [15]. The proaram 
will samnle one analog sianal. The data is stored in buffers and 
written on maanetic tape from these buffers. It is desianed to fill 
one buffer while the other buffer is dumnina onto the mannetic tane. 
Since the write scheme for the maqnetic tane is inherently slow, the 
sampling rate is presently limited to a maximum of 2500 Hz. If only 
two buffers of data is desirable, this rate can be sliahtly exceeded. 

The procedure is to call the Subroutine SAMPLE(NREC ,NSAMP,IBUFO, 
IBUF1,NTAPE,ITOG) where: 


NREC = number of records/run. 
NSAMP = number of samples/record. 
IBUFO = first data buffer. 

IBUF1 = second data buffer 

NTAPE = magnetic tape unit number 


ITOG = 0 if IBUFO is currently being filled. 
1 if IBUF] is currently being filled. 


After the program has been accented by the SDS 9300 and the parameters 
initialized, the digital comnuter waits for the analoa computer to ao 


to "compute." The compute switch can be manually triaqaered or trionered 
by a control sianal. 

Once the A/D routine commences, it continues until either the analog 
goes out of "compute" or the number of records of data has been taken. 
Insure after the proper number of records have been taken that the key- 
board is put back into reset, if trigqgerina manually. Control returns 
to the diaital program. The digital proaram waits for instructions. 


If Sense Switch 6 is lighted, the program returns to initialization and 


waits for the analog to again ao to compute to diaitize a new run of 


iS 


data. If Sense Switch 5 is liahted, the magnetic tane is rewound to 
load noint. If Sense Switch 4 is lighted, the End of File condition is 
entered on the tape. Sense Switch 3 will cause the iine printer to 
print the last 2 records of data in a print plot routine. 

Enclosure (1) is a diagram of the CI 5000 analog and logic natch- 
board confiauration for the A/D conversion. Enclosure (2) is the 
proaram listing for the Fortran IV routine utilized. 

Below are listed a number of hints to save effort and time. 


1) Use a maanetic tane provided by the IBM 360/67 computer facility 
as it has a nermanently marked load point with the correct spacina. 
The tape unit number is 1] and should be set on "automatic." 


2) Ensure that digital program uses fresh cards, particularly the 
RTM BONT card; a feed check on the card reader is often sensed as a 
program error. 


3) Follow basic 9300 instructions. If the control console fails to 
start proaram feed, turn off the system and start over. This "immediate 
action battle drill" could save many hours. 


4) Use multichannel oscilloscope on the analog console to monitor the 
pre-filtered sianal, the post-filtered sianal and the amplified sianal. 
This will allow a visual check to ensure proner analoa action. 


5) Monitor hybrid interface console durina diaitizina to ensure 
that the system is working. 


6) Make at least the first two and last two runs from a known analog 
Sianal (known frequency and amplitude). This is necessary to ensure the 
conversion to the IBM 360 is complete and correct. An additional sua- 
aestion is to insert a run of known data neriodically throughout the 
cata of interest to also allow a check of the conversion. 


7) Read NPGS Computer Facility Technical Note 0211-64 prior to 
attemntina A/D conversion. 
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B. CONVERSION TO THE IBM 360/67 
The proaram used to convert from seven track, 24 bits to nine track, 

32 bit data is listed in Fnclosure (3). The proaram is both Fortran IV 
and assembly lanquage. The control cards are listed below. 
Card No. 

1 //Name (standard job card) ... 

Zs //FOURIER EXEC FORTCALG,TIME.GO=05,REGION .GN=160K 

3 J PEORT 4S SIN DDE 


(main FORTRAN IV PROGRAM) 
y* 
4 //ASM.SYSIN DD * 


(ASSEMBLY LANGUAGE PROGRAM) 


9 //GO.FTO2F001 DD UNIT=0C0,VOL=SER=NPSYYY ,LABEL=(,BLP) , a 

6 Ve NISP=(OLD,KEEP) ,DCB=(DEN=0 ,RECFM=F ,BLKSIZE= 
4096) 

/ //GO0.FTO4FO01 DD UNIT=2400,VOL=SER=NPSXXX ,LABEL=(,SL) ,DISP= 
(NEW, KEEP), 2 

8 1A DCB= (DEN=2 ,RECFM=F ,BLKSIZE=4096 ) ,DSNAME= 


Card #1,2,3, are the standard control cards for the NPGS Comnuter 
facility. Card 4 indicates the following proaqram is in assembly lan- 
quage. Card 5 and 6 tell the computer where the seven track data is 
located and its format. For example, the card above says the 7-track 
data is located on tape NPSYYY, to be placed on tape drive 02, the 
label nrocessina is to be bynassed, with fixed format, with blocksize 
equal to 4096 bits, (1024 words). Cards 7 and 8 tell the computer 


where the data is to be transferred. The data is to be written on 
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tape NPSXXX, tape drive 04, with standard label, fixed format in blocks 
of 4096 bits (1024 words). The 1024 words corresponds to the number of 
samples/record in the Subroutine SAMPL on the SDS 9300 when NSAMP=1024. 
Anticipate difficulty in the conversion of the data. The tane 
drives are the weakest link and will require persistence to be success- 
ful. The first record of the data will probably be missed. The known 


data will allow a quick check of the system. 
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DATA 


Enclosure(3) To Appendix B 
THIS ORNGRAM INTERFACES THE SDS 9300 TD THE 184360 


/PAYAQOOS3 JOR (9053,47FP), *MYATT + MSGLFEVEL=1,CL4SS=G 
//7=PURTEF EXEC FRICALGP,TI ME. GO=10, REGION. GO=160K 
I/FOR TS SYSTN PN x 
NIMENSTON INDATA(1024) ,DATA(1024) 
FACTPR=190,0/(2%*31-1) 
DOCWIND 2 
QT WINN 
; No Jel WHERE K IS THE NUMBES OF BLOCKS OF BO@4 Fae 
NP 3) J=1913 
PEAN(2,3,END=40,FRR=50) INDATA 
3 enomatl14(454A4) } 
CALL FO@M(TNDATA) 
Wott, 70) J 
To Enema T €974*,70X,"RECO2D NO.=' IS) 
OI 1 1=1,1924 
VATACT)=ENAA TACT) ESC TOR 
WRITS( 6,65) (DETACT) oI =1 91024) 
£6 ENPMAT(LX,10E10.2) 
MOTTE (4,3) DATA 
en i231 
SO WRITE (6,51) J 
51 EMPMAT (404,5XK,*READ ERROR, RECIRD NU.=', 14) 
31 CONTINUE 
STOP 
£0 WRITE Chis4)). 9 
41 ENP Met (10',5X%, "ENN ME TAPE, RECORD NQ.V=', 14) 
Ap 
ZA 


i. SYS Teen) 


=-ORM START QO 
k SUBROUTINE FORM(TNDATA) 
P4 
‘ ¢ INDATA OF AN ARRAY LENGTH SOECTIFIED BY THE INDEX VELU 
x THIS SUBROUTINE WILL CONVERT 24 RIT BINARY WORDS STORED IN 
, SE TN 32 RIT BINARY WORDS AND PLACE THESE SAME WORKS BA 
x $CK INTO YTNDATA 
x 
*% 
“am 40d 2, 1213) THIS SUBROUTINE CONVERTS 
BALR 6,0 24 BIT BINAPY WORDS Tr 
USTNG *,6 32 BIT WORNS 
USING DATA ,7 
Sp vu 
U 11,=F 1024" THIS IS THE INDEX VALUE 
L 12,0(1) 
n1¢ { 2 NUM(12) 
[° ar 
Semi 202 
cy 2,2 
G1 2,64 
Say ti(‘ié‘a? gD 
<oml 2,6 
SR- 292 
“RAL 2546 
eT 3° NUM(12) 
LA 13,4(12) 
Bea Lt.toge 
iM Ful aged) 
MVT 12013),X'FEFY 
RC? sol, 14 
LAT) NSec Tt 
thy Ns LF 


TER eT OO NN PNET=OC 0» VOL=SER=DIVERS wl ABEL EC) BLIP), 
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$96) 
41/60 .FTOSFOOL D 
SNEW,KEFP) 
ff 
$053 


DI SP=(O0LD,KFEP) ,OCB=(DEN=0 ,RECFM=F,BLKS IZE=40 
D UNI T=2400 » VOL=SER=NPS234 ,LABEL=(ySL)y DISP= 
8cB=(MEN=2 RECEM=F, BLKSIZE= 40°96)» DSNAME=MYAQO 
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COMPUTER PROGRAM HARM 
PROGRAM COMPUTES POWER SPECTRUM FOR 8192 DATA POINTS 
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URIER TRANSFCRM A(T1,12,13)*/(4(1X,* ACT 15, 


2000(0T11, 


1) OO O ONO 


GU To 35 
387 O(1)=B(1) 
OC 988 11=2,125S 
ape) 
[3=11¢l 
DI T1)= 065*B(11)+0.25*(B(12)+B(13)) 
$88 CONTINUE 
CC 989 T1=1,1259 
B(I1)=D(11) 
38S CONTINUE 
WRITE (6,966) 
966 FORMAT(LHO,* SECOND SMOOTHING FUNCTION USED") 
GO TO 990 
35 CONTINUE 
//GC.FETC4F001 DD DSNAME=MYA0053,UNIT=2400,0CB=(RECFM=F,BLKSI 
$7&=4096),. * 


" VOLUME=( PRIVATE, SER=NPS234) 
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